Scientific and Technical Information Center , 

SEARCH REQUEST FORM 

Requester's Full Name: ZTeXfcfn b > <wjy g .( Examiner U Date* H-ThriOor 

Art Unit: )bS^ Phone Number: 2- 0 °i 6>T~ Serial Number: 1SS O^I^\ t t** , ^ 

Location (Bldg/Room#): ftgn 3PH (Mailbox #): 1C\& Results Format Preferred (circle): PAPER (^ISKy . 
***************************************************** 

To ensure an efficient and quality search, please attach a copy of the cover sheet, claims, and abstract or fill out the following: 

Title of Invention: (V^o ( r c ^^k, CAi A)Ucrena> A\ iV>cne.4i< 

Inventors (please provide full names 



Earliest Priority Date: ll-tVaaoj 

Search Topic: 

Please provide a detailed statement oj f the search topic, and describe as specifically as possible the subject matter to be searched. Include the 
elected species or structures, keywords, synonyms, acronyms, and registry numbers, and combine with the concept or utility of the invention. 
Define any terms that may have a special meaning. Give examples or relevant citations, authors, etc., if known. 

*For Sequence Searches Only* Please include all pertinent information (parent, child, divisional, or issued patent numbers) along with the 
appropriate serial number. 



STAFF USE* ONLY Type of Search Vendors and cost where applicable 

Searcher: NA Sequence (#) STN Dialog 

Scorcher Phons U: A A Sequence (tt) Quesiei/Oifcit LexisAMcxis 

Searcher Location: Structure (#) Wcstlaw WWW/Intemei 

Date Searcher Picked Up: Bibliographic In-house sequence systems 



=> fil reg; d que 13; d que 14; fil capl ; d que 17 

FILE 'REGISTRY' ENTERED AT 16:29:49 ON 27 APR 2005 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2005 American Chemical Society (ACS) 

Property values tagged with IC are from the ZIC/VINITI data file 
provided by InfoChem. 
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ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 

INVENTOR (S) : 

PATENT ASSIGNEE (S) : 
SOURCE : 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2005:260218 CAPLUS Full -text 
142 :334913 

Multi-tumor ant igen- expressing vectors for preventing 
and/or treating melanoma 

Berinstein, Neil; Tartaglia, . Jim; Parrington, Mark; 
Panicali, Dennis; Gritz, Linda 

Aventis Pasteur Limted, Can.; Therion Biologies, Inc. 

PCT Int. Appl . , 88 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



PATENT NO. 






KIND 


DATE 






APPLICATION 


NO. 




DATE 


WO 2005026370 




A2 




20050324 




WO 2004-US28751 




20040903 


W: AE, 


AG, 


AL, 


AM, 


AT, 


AU, 


AZ, 


BA, 


BB, 


BG, BR, 


BW, 


BY, 


BZ, 


CA, CH, 


CN, 


CO, 


CR, 


CU, 


cz, 


DE, 


DK, 


DM, 


DZ, 


EC, EE, 


EG, 


ES, 


FI, 


GB, GD, 


GE # 


GH, 


GM # 


HR, 


HU, 


ID, 


IL, 


IN, 


is, 


JP, KE, 


KG, 


KP, 


KR, 


KZ, LC, 


LK, 


LR, 


LS, 


LT, 


LU, 


LV, 


MA, 


MD, 


MG, 


MK, MN, 


MW, 


MX, 


MZ, 


NA, NI, 


NO, 


NZ, 


OM, 


PG, 


PH, 


PL, 


PT, 


RO, 


RU, 


SC, SD, 


SE, 


SG, 


SK, 


SL, SY, 


TJ, 


TM, 


TN, 


TR, 


TT, 


TZ, 


UA, 


UG, 


US, 


UZ, VC, 


VN, 


YU, 


ZA, 


ZM, ZW 


RW: BW, 


GH, 


GM, 


KE, 


LS, 


MW, 


MZ, 


NA, 


SD, 


SL, SZ, 


TZ, 


UG, 


ZM, 


ZW, AM, 


AZ, 


BY, 


KG, 


KZ, 


MD, 


RU, 


TJ, 


TM, 


AT, 


BE, BG, 


CH, 


CY, 


cz, 


DE, DK, 


EE, 


ES, 


FI, 


FR, 


GB, 


GR, 


HU, 


IE, 


IT, 


LU, MC, 


NL, 


PL, 


PT, 


RO, SE, 


SI, 


SK, 


TR, 


BF, 


BJ, 


CF, 


CG, 


CI, 


CM, 


GA, GN, 


GQ, 


GW, 


ML, 


MR, NE, 


SN, 


TD, 


TG 
























PRIORITY APPLN. 


INFO 














US 2003-500572P 
US 2003-504007P 




P 20030905 
P 20030918 



ED Entered STN : 25 Mar 2 005 



AB The present invention provides multi-antigen vectors for administration to a 
patient to prevent and/or treat cancer. In particular, the invention relates 
to epitope peptides and nucleic acid sequences encoding such peptides for use 
in diagnosing, treating, or preventing melanoma. Provided are vectors for 
preventing and/or treating melanoma co-expressing immunogenic targets selected 
from NY-ESO-1, TRP-2, gplOO, gplOOM, a MART antigen, MART- 1 , a MAGE antigen, 
MAGE - 1 , and MAGE -3 . The multi -antigen vector may also encode an immune 



stimulator such as a co-stimulatory mol . and/or be administered with an 
ad j uvant . 
IT 848511-63-7P 

RL: BPN (Biosynthetic preparation) ; PRP (Properties) ; THU (Therapeutic 
use) ; BIOL (Biological study) ; PREP (Preparation) ; USES (Uses) 

(amino acid sequence; multi-tumor ant igen- expressing vectors for 
preventing and/or treating melanoma) 
RN 848511-63-7 CAPLUS 

CN Cell adhesion molecule ICAM-1 (intercellular adhesion mol. 1) (human) 
(9CI) (CA INDEX NAME) 



SEQ 1 MAPSSPRPAL PALLVLLGAL FPGPGNAQTS VSPSKVILPR GGSVLVTCST 

51 SCDQPKLLGI ETPLPKKELL LPGNNRKVYE LSNVQEDSQP MCYSNCPDGQ 
101 STAKTFLTVY WTPERVELAP LPSWQPVGKN LTLRCQVEGG APRANLTWL 
151 LRGEKELKRE PAVGEPAEVT TTVLVRRDHH GANFSCRTEL DLRPQGLELF 
2 01 ENTSAPYQLQ TFVLPATPPQ LVSPRVLEVD TQGTWCSLD GLFPVSEAQV 
251 HLALGDQRLN PTVTYGNDSF SAKASVSVTA EDEGTQRLTC AVILGNQSQE 
301 TLQTVTI YSF PAPNVILTKP EVSEGTEVTV KCEAHPRAKV TLNGVPAQPL 
351 GPRAQLLLKA TPEDNGRSFS CSATLEVAGQ LIHKNQTREL RVLYGPRLDE 
4 01 RDCPGNWTWP ENSQQTPMCQ AWGNPLPELK CLKDGTFPLP IGESVTVTRD 
4 51 LEGTYLCRAR STQGEVTREV TVNVLSPRYE IVIITWAAA VIMGTAGLST 
501 YLYNRQRKIK KYRLQQAQKG TPMKPNTQAT PP 
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ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 



INVENTOR (S) : 
PATENT ASSIGNEE (S) : 
SOURCE : 



CAPLUS COPYRIGHT 2 005 ACS on STN 
2005:232398 CAPLUS Full-text 
142:310962 

Methods for identifying genes associated with enhanced 

resistance to HIV1 or HCV infection in primates and 

targets for therapy 

Messier, Walter; Sikela, James 

Evolutionary Genomics, Lie, USA 

U.S., 98 pp., Cont . -in-part of U.S. Ser. No. 942,252. 
CODEN: USXXAM 



DOCUMENT TYPE: Patent 

LANGUAGE : Engl i sh 

FAMILY ACC. NUM. COUNT: 10 
PATENT INFORMATION: 



PATENT NO. 


KIND 


DATE 


APPLICATION NO. 




DATE 


US 6866996 


Bl 


20050315 


US 


2002-98600 




20020314 


US 6228586 


Bl 


20010508 


US 


1999-240915 




19990129 


US 6280953 


Bl 


20010828 


US 


2000-591435 




20000609 


US 2005037400 


Al 


20050217 


US 


2004-883576 




20040630 


PRIORITY APPLN. INFO. : 






US 


1998-73263P 


P 


19980130 








US 


1998-98987P 


P 


19980902 








US 


1999-240915 


A2 


19990129 








US 


2000-591435 


A2 


20000609 








US 


2001-942252 


A2 


20010828 








US 


2002-98600 


A2 


20020314 








US 


2003-484030P 


P 


20030630 








US 


2004-545604P 


P 


20040217 



ED Entered STN: 17 Mar 2 005 

AB . The present invention provides methods for identifying evolut ionarily 

significant polynucleotide and polypeptide sequences in human and/or non-human 
primates which may be associated with a physiol. condition, such as enhanced 



J 
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=> d his full 

(FILE ' HOME 1 ENTERED AT 16:18:57 ON 27 APR 2005) 

FILE ' LREGISTRY ' ENTERED AT 16:19:12 ON 27 APR 2005 
LI 0 SEA ABB=ON HKNQT/SQSP 

L2 6 SEA ABB=ON HKNQT/SQSFP 

D SQIDE 1-6 

FILE 'REGISTRY' ENTERED AT 16:19:50 ON 27 APR 2005 
L3 433 SEA ABB=ON HKNQT/SQSP 

SAVE TEMP L3 RUS124SEQ1/A 
L4 0 SEA ABB=ON L3 AND SQL<11 

FILE 'CAPLUS' ENTERED AT 16:20:37 ON 27 APR 2005 
L5 173 SEA ABB = ON L3 

L6 168797 SEA ABB = ON CELL? /OBI (L) (ADHE?/OBI OR GROW?/OBI OR SECRET7/OBI) 

L7 52 SEA ABB = ON L5 AND L6 

FILE 'REGISTRY' ENTERED AT 16:22:38 ON 27 APR 2005 

FILE 'REGISTRY' ENTERED AT 16:23:00 ON 27 APR 2005 
D QUE 

FILE 'REGISTRY' ENTERED AT 16:23:22 ON 27 APR 2005 
D QUE L3 
D QUE L4 

FILE 'CAPLUS' ENTERED AT 16:23:22 ON 27 APR 2005 
D QUE L7 

D IBIB ED ABS HITSEQ L7 
D IBIB ED ABS HITRN 2-52 
SEL HIT RN L7 1-52 

FILE 'REGISTRY' ENTERED AT 16:29:49 ON 27 APR 2005 
D QUE L3 
D QUE L4 

FILE 'CAPLUS' ENTERED AT 16:29:49 ON 27 APR 2005 
D QUE L7 

D IBIB ED ABS HITSEQ L7 1-52 
FILE 'HOME' ENTERED AT 16:30:39 ON 27 APR 2005 



FILE HOME 
FILE LREGISTRY 

LREGISTRY IS A STATIC LEARNING FILE 

NEW CAS INFORMATION USE POLICIES, ENTER HELP USAGETERMS FOR DETAILS. 
FILE REGISTRY 

Property values tagged with IC are from the ZIC/VINITI data file 
provided by InfoChem. 



STRUCTURE FILE UPDATES: 26 APR 2005 HIGHEST RN 849322-79-8 



DICTIONARY FILE UPDATES: 26 APR 2005 HIGHEST RN 849322-79-8 



New CAS Information Use Policies, enter HELP USAGETERMS for details. 

TSCA INFORMATION NOW CURRENT THROUGH JANUARY 18, 2005 

Please note that search-term pricing does apply when 
conducting SmartSELECT searches. 

******************** ****** ********** *************************** 

* * 

* The CA roles and document type information have been removed from * 

* the IDE default display format and the ED field has been added, * 

* effective March 20, 2005. A new display format, IDERL, is now * 

* available and contains the CA role and document type information. * 

* * 
*************************************************************** 

Crossover limits have been increased. See HELP CROSSOVER for details. 

Experimental and calculated property data are now available. For more 
information enter HELP PROP at an arrow prompt in the file or refer 
to the file summary sheet on the web at: 
http : //www. cas . org/ONLINE/DBSS/registryss . html 

FILE CAPLUS 

Copyright of the articles to which records in this database refer is 
held by the publishers listed in the PUBLISHER (PB) field (available 
for records published or updated in Chemical Abstracts after December 
26, 1996), unless otherwise indicated in the original publications. 
The CA Lexicon is the copyrighted intellectual property of the 
American Chemical Society and is provided to assist you in searching 
databases on STN. Any dissemination, distribution, copying, or storing 
of this information, without the prior written consent of CAS, is 
strictly prohibited. 

FILE COVERS 1907 - 27 Apr 2005 VOL 142 ISS 18 
FILE LAST UPDATED: 26 Apr 2005 (20050426/ED) 

New CAS Information Use Policies, enter HELP USAGETERMS for details. 

This file contains CAS Registry Numbers for easy and accurate 
substance identification . 



= > 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



April 27, 2005, 18:56:18 ; Search time 38 Seconds 

(without alignments) 
12.660 Million cell updates/sec 



283416 



Title: US-09-992-124C-34 
Perfect score: 29 
Sequence: 1 HKNQT 5 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283416 seqs,. 96216763 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : PIR_79:* 

1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 

NO. 


Score 


Query 

Match Length 


DB 


ID 


Description 


1 


29 


100.0 


278 


2 


A84545 


hypothetical prote 


2 


29 


100.0 


320 


2 


T3 08 94 


lipase hommolog TO 


3 


29 


100.0 


532 


1 


A29849 


intercellular adhe 


4 


29 


100.0 


677 


2 


G69895 


formate dehydrogen 


5 


29 


100. 0 


989 


2 


T15397 


hypothetical prote 


6 


29 


100. 0 


1020 


2 


G88208 ' 


protein K02A2 . 3 [i 


7 


29 


100. 0 


1075 


1 


OYRTHX 


heat -stable entero 


8 


29 


100.0 


1516 


2 


T41235 


probable myosin he 


9 


29 


100.0 


2201 


2 


A54774 


ATP binding casset 


10 


26 


89.7 


167 


2 


AC04 81 


probable gluconoki 


11 


26 


89.7 


184 


2 


G83591 


hypothetical prote 


12 


26 


89.7 


279 


2 


S55105 


hypothetical prote 


13 


26 


89.7 


326 


2 


C86872 


UDPglucose 4-epime 



14 


26 


89 


. 7 


351 


2 


T19372 


hypothetical prote 


15 


26 


89 


.7 


393 


2 


S48381 


RRD 1 protein - yea 


16 


26 


89 


.7 


411 


2 


D88504 


protein B0361.7 [i 


17 


26 


89 


. 7 


416 


2 


G86488 


protein T32E20.31 


18 


26 


89 


. 7 


481 


2 


JC7961 


proton- coupled ami 


19 


26 


89 


. 7 


519 


2 


T22091 


hypothetical prote 


20 


26 


89 


. 7 


530 


2 


F84905 


probabl e cytochrom 


21 


26 


89 


. 7 


717 


1 


VGBE11 


glycoprotein H pre 


22 


26 


89 


. 7 


779 


2 


C96805 


hypothetical prote 


23 


26 


89 


. 7 


783 


2 


T18421 


hypothetical prote 


24 


26 


89 


.7 


873 


2 


S46584 


probable membrane 


25 


26 


89 


.7 


890 


2 


T22186 


hypothetical prote 


26 


26 


89 


. 7 


1022 


2 


E84792 


probable kines in h 


27 


26 


89 


. 7 


1029 


2 


T00712 


protein kinase hom 


28 


26 


89 


.7 


1139 


2 


A49370 


E1A- associated cvc 


29 


26 


89 


. 7 


1291 


2 


T21694 


hvoothet ical prote 


30 


26 


89 


. 7 


1435 


2 


S59384 


hypothetical prote 


31 


26 


89 


. 7 


1577 


2 


A35140 


hemolysin A precur 


32 


26 


89 


. 7 


2588 


2 


T14342 


NSD1 protein - mou 


33 


25 


86 


.2 


20 


2 


PN0115 


insul in- 1 ike growt 


34 


25 


86 


.2 


63 


2 


S70821 


H+ - transport ing tw 


35 


25 


86 


. 2 


107 


2 


G85676 


unknown p ro t e i n en 

" fc4 4 * F 4 4 4. ^ 4 4 4 4 


36 


25 


86 


.2 


107 


2 


C90817 


hypothetical prote 


37 


25 


86 


.2 


121 


2 


H89903 


conserved hvoothet 


38 


25 


86 


.2 


193 


2 


S63689 


dehydrin-like prot 


39 


25 


86 


2 


'227 


2 


V — U 1 J 1 U 




40 


25 


86 


.2 


233- 


2 


T39948 


hypothetical prote 


41 


25 


86 


.2 


240 


2 


S59085 


ribosomal protein 


42 


25 


86 


.2 


248 


2 


T26584 


hypothetical prote 


43 


25 


86 


.2 


258 


2 


B49597 


nonstructural prot 


44 


25 


86 


.2 


262 


2 


T22489 


hypothetical prote 


45 


25 


86 


.2 


264 


2 


F89833 


teichoic acid tran 



ALIGNMENTS 



RESULT 1 
A84545 

hypothetical protein At2gl6850 [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Feb-2001 #sequence_revision 02-Feb-2001 #text_change 16-Feb-2001 
C; Access ion: A84 545 

R;Lin, X.; Kaul , S . ; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; 
Fujii, C.Y-. ; Mason, T.M. ; Bowman, C.L:; Barnstead, M.E.; Feldblyum, T.V.; Buell, 
C.R.; Ketchum, K.A. ; Lee, J.J.; Ronning, CM.; Koo, H . ; Moffat, K.S.; Cronin, 
L.A.; Shen, M.; VanAken, S.E.; Umayam, L. ; Tallon, L.J.; Gill, J.E.; Adams, 
M.D.; Carrera, A.J. ; Creasy, T.H.; Goodman, H.M.; Somerville, C.R.; Copenhaver, 
CP.; Preuss, D. ; Nierman, W.C; White, 0.; Eisen, J.A. ; Salzberg, S.L.; Fraser, 
CM. ; Venter, J.C 
Nature 402, 761-768, 1999 

A; Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis 
thaliana . 

A;Reference number: A84420; MUID: 20083487 ; PMID: 10617197 
A; Access ion: A8454 5 
A; Status : preliminary 
A; Molecule type: DNA 



A;Residues: 1-278 <STO> 

A; Cross -references : GB:AE002093; NID:g3757514 ; PIDN: AAC64216 . 1 ; GSPDB : GN00139 
C;Genetics : 
A;Gene: At2gl6850 
A; Map position: 2 

C; Super family: lens fiber membrane major intrinsic protein 

Query Match 100.0%; Score 29; DB 2; Length 278; 

Best Local Similarity 100.0%; Pred. No. 22; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 HKNQT 5 

Db 59 HKNQT 63 



Search completed: April 27, 2005, 19:05:48 
Job time : 42 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



April 28, 2005, 12:07:02 ; Search time 176 Seconds 

(without alignments) 
14.548 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-992-124C-34 
29 

1 HKNQT 5 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1612378 seqs, 512079187 residues 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 

Maximum Match 100% 
Listing first 1000 summaries 
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Database 



UniProt_03 :* 
1 : uniprot_sprot : * 
2 : uniprot trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


29 


100. 


0 


130 


2 


Q972R5 


Q972r5 


sulfolobus 


2 


29 


100. 


0 


169 


2 


Q6YQ99 


Q6yq99 


onion yello 


3 


29 


100. 


0 


169 


2 


Q8A3B7 


Q8a3b7 


bacteroides 


4 


29 


100. 


0 


196 


2 


Q9GNM6 


Q9gnm6 


ostertagia 


5 


29 


100. 


0 


196 


2 


Q9N9C5 


Q9n9c5 


ostertagia 


6 


29 


100. 


0 


218 


2 


Q8A0X0 


Q8a0x0 


bacteroides 


7 


29 


100. 


0 


248 


2 


Q9BS2 0 


Q9bs20 


homo sapien 


8 


29 


100. 


0 


276 


2 


Q15463 


Q15463 


homo sapien 


9 


29 


100. 


0 


278 


1 


PI28_ARATH 


Q9zvx8 


arabidopsis 


10 


29 


100. 


0 


289 


2 


Q6QF96 


Q6qf96 


triticum ae 


11 


29 


100 . 


0 


289 


2 


Q75RZ4 


Q75rz4 


triticum ae 


12 


29 


100. 


0 


297 


2 


Q805L2 


Q80512 


saimiriine 


13 


29 


100. 


0 


297 


2 


Q80BH5 


Q80bh5 


saimiriine 


14 


29 


100. 


0 


297 


2 


Q80BM8 


Q80bm8 


saimiriine 


15 


29 


100. 


0 


297 


2 


Q80BR3 


Q80br3 


saimiriine 


16 


29 


100. 


0 


343 


2 


Q9C086 


Q9C086 


homo sapien 


17 


29 


100. 


0 


345 


2 


Q99PT3 


Q99pt3 


mus musculu 


18 


29 


100. 


0 


356 


2 


Q96B14 


Q96bl4 


homo sapien 


19 


29 


100. 


0 


356 


2 


Q80ZV5 


Q80zv5 


mus musculu 


20 


29 


100. 


0 


366 


2 


Q6E5N8 


Q6e5n8 
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ALIGNMENTS 



RESULT 1 
Q972R5 

ID Q972R5 PRELIMINARY; PRT; 130 AA. 

AC Q972R5; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein ST1069. 

GN Order edLocusNames = ST 1 069; 

OS Sulfolobus tokodaii. 

OC Archaea; Crenarchaeota; Thermoprotei ; Sulfolobales ; Sulfolobaceae; 

OC Sulfolobus. 

OX NCBI_TaxID=l 11955; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=JCM 10545 / 7; 

RX MEDLINE=21456156; PubMed=l 1572479 ; 

RA Kawarabayasi Y. , Hino Y. , Horikawa H., Jin-no K. , Takahashi M . , 

RA Sekine M. , Baba S. -I . , Ankai A. , Kosugi H. , Hosoyama A. , Fukui S. , 

RA Nagai Y. , Nishijima K. , Otsuka R. , Nakazawa H. , Takamiya M., Kato Y. 

RA Yoshizawa T. , Tanaka T., Kudoh Y., Yamazaki J., Kushida N. , Oguchi A 

RA Aoki K.-I., Masuda S., Yanagii M., Nishimura M. , Yamagishi A., 

RA Oshima T. , Kikuchi H. ; 

RT "Complete genome sequence of an aerobic thermoacidophilic 

RT Crenarchaeon, Sulfolobus tokodaii strain7."; 

RL DNA Res. 8:123-140(2001). 

DR EMBL; AP000984; BAB66099.1; -. 

KW Complete proteome; Hypothetical protein. 

SQ SEQUENCE 130 AA; 15329 MW; E0ACEE95C0F08 9BF CRC64 ; 

Query Match 100.0%; Score 29; DB 2; Length 130; 

Best Local Similarity 100.0%; Pred. No. 54; 

Matches 5; Conservative 0; Mismatches 0; . Indels. 0; Gaps 

Qy 1 HKNQT 5 

Mill 

Db 3 HKNQT 7 

Search completed: April 28, 2005, 12:24:04 
Job time : 179 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title.: 

Perfect score: 
Sequence: 



April 28, 2005, 12:06:47 



US-09-992-124C-34 
29 

1 HKNQT 5 



Search time 132 Seconds 

(without alignments) 

12.618 Million cell updates/sec 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



1426032 seqs, 333106140 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 100% 

Maximum Match 100% 
Listing first 1000 summaries 
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Database : 



Published_Applications_AA: 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 



/cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep: * 
/cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 
/cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 
/cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 
/cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: * 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 0 8_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 
/cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep: * 
/ cgn2_6 /pt oda t a / 1 /pubpaa /US 0 9B_PUBC0MB . pep : 
/cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: * 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 1 0A_PUBCOMB . pep 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 1 0 B_PUBCOMB . pep 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 1 0 C_PUBCOMB . pep 
/ cgn2_6 /pt oda t a / 1 /pubpaa /US 1 0 D_PUBCOMB . pep 
/ cgn2_6 /pt oda t a / 1 /pubpaa /US 1 0_NEW_PUB . pep 
/ cgn2_6 /pt oda t a / 1 /pubpaa /US 1 1_NEW_PUB . pep 
/cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep 
/cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-867-852-5 

Sequence 5, Application US/09867852 
Patent No. US20020147324A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ausubel, Frederick M . 
Staskawicz, Brian J. 
Brent , Andrew F . 
Dahlbeck , Douglas 
Katagiri , Fumiaki 
Kunkel , Barbara N. 
Mindrinos , Michael N. 
Yu, Guo-Liang 

TITLE OF INVENTION: RPS2 GENE FAMILY, PRIMERS , PROBES, AND 
TITLE OF INVENTION: DETECTION METHODS 
FILE REFERENCE: 00786/254002 
CURRENT APPLICATION NUMBER: US/09/867 , 852 
CURRENT FILING DATE: 2001-05-29 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 09/301,085 
PRIOR FILING DATE: EARLIER FILING DATE: 1999-04-28 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 08/310,912 
PRIOR FILING DATE: EARLIER FILING DATE: 1994-09-22 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER : . 08/227,360 
PRIOR FILING DATE: EARLIER FILING DATE: 1994-04-13 
NUMBER OF SEQ ID NOS : 2 08 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 5 
LENGTH: 11 
TYPE: PRT 

ORGANISM: Arabidopsis thaliana 
US-09-867-852-5 



Query Match 



100.0%; Score 29; DB 9; Length 11; 



Best Local Similarity 100.0%; Pred. No. 7.8 
Matches 5; Conservative 0; Mismatches 



Qy 1 HKNQT 5 

Mill 

Db 5 HKNQT 9 



Search completed: April 28, 2005 , . 12 : 21 : 01 
Job time : 133 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



April 28, 2005, 12:03:36 ; Search time 41 Seconds 

(without alignments) 
9.104 Million cell updates/sec 



Title: US-09-992-124C-34 
Perfect score: 29 
Sequence: 1 HKNQT 5 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 513545 seqs, 74649064 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 

Maximum Match 100% 
Listing first 1000 summaries 

Database : 



48 



Issued_Patents_AA: * 

1 : / cgn2_6 /p t oda ta/l/iaa/ 5A_C0MB .pep * * 

2 : /cgn2_6/ptodata/l/iaa/5B__COMB.pep: 

3 : /cgn2_6/ptodata/ 1 / iaa/6A_COMB . pep 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_C0MB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

US-08-310-912A-5 

Sequence 5, Application US/08310912A 
Patent No. 5981730 
GENERAL INFORMATION : 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Ausubel , Frederick M. 
Staskawicz, Brian J. 
Brent , Andrew F . 
Dahlbeck, Douglas 
Katagiri, Fumiaki 
Kunkel , Barbara N. 
Mindrinos, Michael N. 
Yu, Guo- Liang 

INVENTION: RPS2 GENE FAMILY, PRIMERS, PROBES, AND DETECTION 



TITLE OF INVENTION: METHODS 



NUMBER OF SEQUENCES: 2 08 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 

CITY: Boston 

STATE : MA 

COUNTRY: USA 

ZIP: 02110-2904 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1 . 0 , Version #1.30B 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /3 1 0 , 912A 

FILING DATE: September 22, 1994 

CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/227,360 

FILING DATE: April 13, 1994 
ATTORNEY/AGENT INFORMATION: 

NAME: Lech, Karen F. 

REGISTRATION NUMBER: 35,238 

REFERENCE/DOCKET NUMBER: 00786/254001 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (617) 542-5070 

TELEFAX: (617) 542-8906 

TELEX: 1002 54 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 11 amino acids 

TYPE: amino acid 

STRANDEDNESS: not relevant 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
■ US-08-310-912A-5 

Query Match 100.0%; Score 29; DB 2; Length 11; 

Best Local Similarity 100.0%; Pred. No. 5; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 HKNQT 5 

Mill 

Db 5 HKNQT 9 



Search completed: April 
Job time : 41 sees 



28, 2005, 12:10:31 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



April 28, 2005, 11:55:21 ; Search time 164 Seconds 

(without alignments) 
11.791 Million cell updates/sec 



Title: US-09-992-124C-34 
Perfect score: 29 
Sequence: 1 HKNQT 5 

Scoring table: BL0SUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2105692 seqs, 386760381 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 

Maximum Match 100% 
Listing first 1000 summaries 



407 



Database : 



A_Geneseq_16Dec04 : * 

1: geneseqpl980s * * 

2: geneseqpl990s 

3: geneseqp2000s 

4: geneseqp2001s 

5: geneseqp2002s 

6 : geneseqp2003as : * 

7 : geneseqp2003bs : * 

8: geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 
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ALIGNMENTS 



RESULT 1 
ABP51729 

ID ABP51729 standard; peptide; 5 AA. 
XX 

AC ABP5172 9; 
XX 

DT 01-OCT-2002 (first entry) 
XX 

DE ICAM 1 cell surface protein peptide. 
XX 

KW Human; epidermal growth factor receptor; EGFR; immunogenic; cancer; 

KW apoptosis; cytostatic; cell surface protein. 

XX 

OS Homo sapiens. 

OS Synthetic . 
XX 

PN WO200247613-A2 . 
XX 

PD 20-JUN-2002 . 
XX 

PF 13-NOV-2001; 2001WO-US047734 . 
XX 

PR 27-NOV-2000; 2000US-00723307 . 
XX 

PA (NOUN ) UNIV NORTHWESTERN. 
XX 

PI Calenoff E, Ditlow CC,- 
XX 

DR WPI; 2002-566624/60. 
XX 

PT Candidates for cancer-specific or cancer-associated antigens useful for 

PT diagnosis and treatment of cancer comprise synthetic peptides. 

XX 

PS Disclosure; Page 16; 53pp; English. 
XX 

CC The present invention describes a method for identifying candidates that 

CC are cancer-specific or cancer-associated antigens. The method involves: 

CC (1) mapping hydrophilic regions of amino acid sequences; (2) identifying 

CC hydrophilic peptide regions that are glycosylated in non-cancerous cells, 

CC but deglycosylated in cancer cells; and (3) synthesising, labeling and 

CC testing peptides. The cancer-specific/associated antigen sequences have 

CC cytostatic activity, and can.be used for promoting cancer cell apoptosis. 

CC The method can be used for identifying candidates that- are cancer- 

CC specific or cancer-associated antigens for use in the diagnosis and 

CC treatment of cancer. The method involves the use of a site of 

CC deglycosylated amino acids in the cancer cells, and so confers a cancer- 

CC specific or highly cancer-associated immunogenicity or marker function to 

CC the peptide and avoiding self -recognition. The peptides are further not 

CC intracellularly expressed, but are located on the cell surfaces 

CC (predictably secreted or released into pericellular fluids) in sufficient 

CC numbers, and so are sufficiently accessible for targeting T cells; and 

CC are present at the earlier stages of cancer progression as well as during 

CC later stages; and are retained on the surface of the cancer cells for a 



CC time sufficient for the therapeutic T cells to find their target and 

CC retain the bound T cells for a time sufficient to affect cancer cell 

CC death. ABP51697 to ABP51771 represent amino acid sequence used in the 

CC exemplification of the present invention 
XX 

SQ Sequence 5 AA; 

Query Match 100.0%; Score 29; DB 5; Length 5; 

Best Local Similarity 100.0%; Pred. No. 1.8e+06; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 HKNQT 5 

Db 1 HKNQT 5 



RESULT 2 
AAO3 0011 

ID AAO30011 standard; peptide; 5 AA. 
XX 

AC AAO30011; 
XX 

DT 03-SEP-2003 (first entry) 
XX 

DE Peptide #19 used to enhance cell secretion. 

XX 

KW Cellular adhesion; growth; expression; secretion. 
XX 

OS Unidentified. 
XX 

PN WO2003044045-A2 . 
XX 

PD 30-MAY-2003. 
XX 

PF 19-NOV-2002; 2 002WO-US0372 07 . 
XX 

PR 19-NOV-2001; 2001US-00992 124 . 
XX 

PA (BECT ) BECTON DICKINSON & CO. 
XX 

PI Campbell RL, Heidaran M, Spargo CA, Wilkins JH, Haaland PD; 
XX 

DR WPI; 2003-482396/45. 
XX 

PT New peptide, useful for enhancing cell growth and/or secretion in a cell 

PT culture system. 

XX 

PS Claim 4; Page 45; 78pp; English. 
XX 

CC The invention relates to peptides having cell adhesion, growth, 

CC expression or secretion-enhancing activities. The peptides of the 

CC invention are useful for enhancing cell growth and/or secretion in a cell 

CC culture system. The present sequence is a peptide used to enhance cell ' 

CC secretion. Note: This sequence SEQ ID NO:34 is stated to be similar to 

CC the sequence shown in page 66 (AAO30049) . However these sequences differ 

XX 

SQ Sequence 5 AA; 



Query Match 100.0%; Score 29; DB 6-; Length 5; 

Best Local Similarity 100.0%; Pred. No. l\8e+06; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 HKNQT 5 

Db 1 HKNQT 5 



RESULT 3 
AAR80135 

ID AAR80135 standard; protein; 11 AA. 
XX 

AC AAR80135; 
XX 

DT 20-APR-1996 (first entry) 
XX 

DE Rps2 disease resistance polypeptide. 
XX 

KW RPS2; transgenic plant; Pseudomonas syringae plant pathogen; 

KW disease resistance; crop improvement ; Arabidopsis ; tomato; soybean; bean; 

KW maize; wheat ; rice . 

XX 

OS Arabidopsis thaliana. 
XX 

PN W09528478-A1 . 
XX 

PD 26-OCT-1995. 
XX 

PF 13-APR-1995; 95WO-US004570 . 
XX 

PR 13 -APR- 1994 ; 94US- 0022736 0 . 
XX 

PA (GEHO ) GEN HOSPITAL CORP. 

PA' (REGC ) UNIV CALIFORNIA. 
XX 

PI Ausubel FM, Staskawicz BJ, Bent AF, Dahlbeck D, Katagiri F; 

PI Kunkel BN, Mindrinos MN, Yu G; 

XX 

DR ' WPI; 1995-373794/48. 

DR N-PSDB; AAT04 798 . 
XX 

PT Pure DNA encoding an Arabidopsis thaliana Rps2 polypeptide - used to 

PT express the polypeptide in plant cells to provide disease resistance to 

PT pathogens . 
XX 

PS Claim 6; Page 45; 88pp; English. 
XX 

CC DNA encoding the Rps2 disease resistance polypeptide (see also AAR8 0132- 

CC 34) can be used to transform a plant cell which is subsequently 

CC propagated into a transgenic plant exhibiting resistance to diseases 

CC caused by plant pathogens carrying an avirulence gene generating signal 

CC recognized by an Rps polypeptide. Specifically, the plant pathogen is 

CC Pseudomonas syringae which carries the avrRpt2 avirulence gene (See 

CC AAT04799) . The plant is a crop plant, especially tomato, soybean, bean, 

CC maize, wheat, rice and Arabidopsis 



XX 

SQ Sequence 11 AA; 



Query Match 100.0%; Score 29; DB 2; Length 11; 

Best Local Similarity 100.0%; Pred. No. 10; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 HKNQT 5 

Db 5 HKNQT 9 
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